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Abstract 

A  simple  model  problem  in  exploration  seismology  requires  that  a  depth- 
varying  sound  velocity  distribution  be  estimated  from  reflected  sound  waves. 
For  various  physical  reasons,  these  reflected  signals  or  echoes  have  very  small 
Fourier  coefficients  at  both  very  high  and  very  low  frequencies.  Nonetheless, 
both  geophysical  practice,  based  on  heuristic  considerations,  and  recent  nu¬ 
merical  evidence  indicate  that  a  spectrally  complete  estimate  of  the  velocity 
distribution  is  often  achievable.  We  prove  a  theorem  to  this  effect,  show¬ 
ing  that  “sufficiently  rough”  velocity  distributions  may  be  recovered  from 
reflected  waves  under  some  restrictions,  independently  of  the  very  low-  or 
high-frequency  content  of  the  data.  The  main  restriction  is  that  the  velocity 
depend  only  on  a  single  (depth)  vartiable;  only  in  this  case  are  sufficiently 
refined  propagation-of-singularity  results  available.  The  proof  is  based  on  a 
novel  variational  principle,  from  which  numerical  algorithms  have  been  de¬ 
rived.  These  algorithms  have  been  implemented  and  used  to  estimate  velocity 
distributions  from  both  synthetic  and  field  reflection  seismograms. 


1  Introduction 


A  simple  model  of  the  physical  setting  for  reflection  seismology  is  constant- 
density  linear  acoustics ,  in  which  the  sound  velocity  field  c(x)  ( x  €  R3)  is 
connected  to  the  pressure  field  u(x,t )  via  the  wave  equation: 


1 

c2(x) 


d2u(x, t ) 


dt 2 


—  Au(x,t)  =  f(t)S(x) 


u  =  0,  t  <  0 


The  right-hand  side  represents  an  isotropic  point  dilatational  energy  source 
radiating  with  time  varying  (transient)  intensity  fit)  (“the  source  wavelet”). 
The  seismogram  is  a  sampling  of  the  pressure  u  at  a  number  of  “receiver” 
points.  We  adopt  the  idealization  that  these  points  form  the  continuum 
{x3  =:  z  =  0}  (“the  surface  (of  the  earth)”)  and  that  the  measurement  of 
u  is  also  continuous  in  time  for  some  time  interval  0  <  t  <  imax-  Regarding 
the  source  (i.e.  f{t))  as  known,  the  pressure  field,  hence  the  seismogram, 
becomes  a  function  of  the  sound  velocity: 

'•=  «l2=o 

In  this  simple  model,  the  fundamental  problem  of  reflection  seismology  is 
to  estimate  c  from  s[c],  i.e.  to  solve  a  functional  equation  of  the  form 

■S[c] 

possibly  in  some  least-error  sense  accommodating  the  possibility  (virtual  cer¬ 
tainty!)  of  inconsistent  data  error. 

This  model  is  grossly  inadequate  for  some  practical  purposes,  as  it  ignores 
significant  physics  of  seismic  wave  generation  and  propagation.  Nonetheless, 
it  forms  the  basis  for  most  contemporary  seismic  data  processing  (see  for 
instance  Yilmaz  (1987)),  and  it  exhibits  two  fundamental  features  of  the 
“real”  problem: 

(i)  s [c]  is  nonlinear  in  c; 

(ii)  f  should  be  chosen  to  suppress  Fourier  components  in  s[c]  at 
“low”  and  “high”  frequencies. 
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Item  (i)  is  simply  the  nonlinearity  of  solutions  of  linear  equations  as  func¬ 
tions  of  their  coefficients.  Item  (ii)  is  required  by  observations  of  the  spectra 
of  reflection  seismograms:  for  various  physical  reasons,  Fourier  components 
at  very  low  (<  4Hz)  and  very  high  (>  80  Hz)  temporal  frequencies  are  essen¬ 
tially  missing  from  real  reflection  seismograms. 

The  suppression  of  high-frequency  components  simply  means  that  c  t— ► 
s[c]  is  a  smoothing  operator.  Techniques  for  management  of  the  resulting 
high-frequency  instability  are  well-known  —  see  e.g.  Tihonov  and  Arsenin 
(1976),  Miller  (1970),  Payne  (1974). 

In  contrast,  the  instability  resulting  from  the  lack  of  low-frequency  data 
has  been  little  discussed  in  the  mathematical  literature  on  inverse  problems, 
even  though  it  is  nearly  ubiquitous  in  real  world  parameter  estimation  prob¬ 
lems  based  on  wave  propagation.  This  low-frequency  lacuna  is  a  striking 
feature  of  reflection  seismology,  in  particular,  and  the  possible  ambiguities 
resulting  from  spectral  incompleteness  of  data  have  sparked  considerable  dis¬ 
cussion  within  the  geophysical  research  community. 

The  present  paper  is  devoted  to  the  proof  of  a  uniqueness  and  continuous 
dependence  result  for  a  restricted  version  of  the  inverse  problem  described 
above,  in  which  the  estimates  are  independent  of  the  behaviour  of  / (u>)  near 
u)  =  0.  We  shall  show  that  c  is  well-determined  by  s[c]  when  c  is  sufficiently 
non-smooth. 

This  rather  strange  sounding  requirement  is  natural  in  view  of  the  appli¬ 
cation  to  reflection  seismology:  rapid  changes  in  the  mechanical  properties 
of  rock  are  entirely  responsible  for  the  return  of  substantial  echoes  to  the 
surface,  hence  for  the  information  content  of  seismic  reflection  data.  The 
times  of  arrival  of  these  echoes  —  or  rather,  the  signals  which  simulate  them 
in  the  model  described  above  —  carry  information  about  the  slowly  varying 
components  of  c,  completely  independently  of  the  low-frequency  behavior  of 
/.  On  the  other  hand,  the  identifiability  of  these  echoes  depends  on  the 
wave  nature  of  the  seismic  disturbance,  in  other  words  on  the  propagation 
of  singularity  (or  regularity),  according  to  geometric  optics.  This  propaga¬ 
tion  property  clearly  requires  some  smoothness  of  the  coefficients;  thus  the 
conditions  necessary  for  the  creation  of  strong  reflected  signals  are  in  tension 
with  those  necessary  for  their  propagation. 

We  show  in  this  paper  that  this  tension  can  be  resolved  at  least  in  a  special 
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case.  The  resolution  requires  certain  estimates  concerning  propagation  of 
regularity  currently  available  only  under  the  additional  constraint: 

The  sound  velocity  c  is  a  function  only  of  the  “depth”  variable 
x3(=  z). 


That  is,  the  result  detailed  here  applies  to  layered  fluid  models  only.  The  nec¬ 
essary  technical  results  for  this  class  of  models  were  established  in  previous 
papers  of  the  author  (Symes  1981,  1983,  1986a,  b).  These  are  essentially  en¬ 
ergy  estimates  and  are  related  to  earlier  work  of  Rauch  and  Taylor  (1974)  and 
Kreiss  (1967)  on  mixed  problems  for  linear  hyperbolic  systems  in  two  inde¬ 
pendent  variables.  Other  authors  basing  results  about  1-D  inverse  problems 
for  hyperbolic  equations  on  the  same  ideas  include  Fawcett  (1984),  Suzuki 
(1988). 

A  few  remarks  concerning  the  prospects  for  weakening  the  layered  medium 
assumption  in  these  arguments  may  be  found  in  the  next  section. 

The  paper  is  organized  as  follows:  Section  2  gives  a  precise  statement  of 
the  main  results,  a  brief  review  of  the  literature,  and  discussion  of  related  is¬ 
sues.  Section  3  introduces  the  plane-wave  decomposition  and  estimates  for  the 
plane- wave  problems  from  the  author’s  previous  work;  this  material  forms  the 
technical  basis  for  the  rest  of  the  paper.  As  we  are  concerned  mostly  with  the 
information  independent  of  the  low-frequency  behaviour  of  /,  we  introduce 
in  Section  4  the  (temporary)  assumption  that  /  is  a  compactly  supported 
measure,  defining  an  elliptic  convolution  operator  bounded  on  L2(R).  Under 
this  assumption,  we  prove  an  estimate  of  Garding  type  for  the  derivative 
of  the  plane- wave  seismogram  map.  In  order  to  do  more,  it  is  necessary  to 
consider  the  various  plane-wave  problems  simultaneously.  Each  plane-wave 
model  is  parameterized  by  the  vertical  plane- wave  velocity,  viewed  as  a  func¬ 
tion  of  (its  own)  travel-time.  These  models  are  a  priori  independent.  We 
derive  a  coherency  condition  in  Section  5,  equivalent  to  the  existence  of  a 
(single)  velocity  profile  c(z)  from  which  all  of  the  plane-wave  models  are  de¬ 
rived  (i.e.  “every  (plane  wave)  experiment  sees  the  same  earth”).  In  Section 
6  we  show  that  a  least-squares  version  of  the  inverse  problem  stated  above, 
posed  in  terms  of  the  plane-wave  models  and  augmented  by  the  coherency 
condition  (as  a  so-called  penalty  term)  has  a  positive-definite  Hessian  (second 
derivative)  operator  at  a  consistent  (zero-residual)  solution  provided  that  the 
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corresponding  velocity  profile  is  sufficiently  rough,  as  stated  above.  Our  main 
result  follows  immediately  via  the  implicit  function  theorem.  So  far  we  have 
maintained  the  elliptic  assumption  concerning  /;  this  is  dropped  in  Section 
7,  for  the  usual  price,  paid  for  the  solution  of  compact  operator  equations, 
of  a  priori  constraints  on  the  smoothness  of  c. 


1.4 


2  Statement  of  Main  Result,  Discussion. 


In  this  section  we  give  a  precise  statement  of  the  major  result  of  this  paper, 
followed  by  a  brief  review  of  the  literature  and  a  conceptual  overview  of  the 
problem  described  above. 

Since  the  principal  goal  of  the  present  work  is  the  production  of  a  “so¬ 
lution”  of  the  inverse  problem  stated  above  with  continuity  properties  inde¬ 
pendent  of  the  low-frequency  behavior  of  the  source  wavelet  /,  we  make  the 
temporary  assumption  that: 


/  G  £'(R)  is  a  Borel  measure,  defining  a  bounded  elliptic  convolu¬ 
tion  operator  on  R):  i.e.  for  positive  Kq,  K\,  K* ,  <f>  G  Hl{ R) 


Kx 


d± 

dt 


< 


f 


LHR) 

d<f) 
dt 


-  Ko  l|^lli,2(R) 


Also,  supp /  C  {1GR:1>0}. 


(2.1) 


Such  a  distribution  necessarily  has  a  finite  first  moment 


rrij  =  sup 


(. fM ) 


The  “elliptic”  assumption  (2.1)  will  be  weakened,  to  the  extent  possible,  in 
Section  7.  The  simplest  example  of  wavelets  (i.e.  kernels)  /  having  the 
elliptic  property  (2.1)  are  obtained  by  subtracting  from  a  slightly  shifted 
Dirac  delta  function  a  smooth  approximation. 

The  conventional,  though  perhaps  ill-founded,  choice  of  measure  for  the 
seismogram  error  is  some  weighted  version  of  the  T2-norm  —  see  Tarantola 
(1987),  Chapter  6.  We  shall  adopt  this  choice  also.  The  elliptic  nature  of  / 
precludes  square-integrability  of  s[c],  however,  as  can  easily  be  seen,  so  we 
choose  what  amounts  to  a  very  singular  “weight”:  we  define,  for  suitably 
small  slowness  p, 


d 

5[c](t,p)  =  J  J  dxldx2-^s[c}(x1,x2,t  +  pxf) 
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That  is,  5  is  a  version  of  the  Radon-transform  of  s,  which  we  shall  further 
restrict  to  a  rectangle  {(t,p):  0  <  t  <  Ti, Pi  <  p  <  P 2}  =:  R\.  It  is  easily 
seen  that,  for  smooth  c,  any  Ti  >  0,  and  P2  sufficiently  small,  S  is  square- 
integrable  (Santosa  and  Symes  (1988)).  Thus  the  basic  data  set  of  this  paper 
will  be  a  member  of  L2(Ri). 

Another  piece  of  notation  needed  to  state  our  main  result  is  used  in 
our  method  of  measuring  “roughness”:  as  mentioned  in  the  introduction,  a 
stable  solution  of  the  inverse  problem  can  only  be  expected  for  sufficiently 
non-smooth  coefficient  c. 

The  “roughness”  measure  depends  on  an  arbitrary  Dirac  kernel  hi  € 
C£°( R)  satisfying 

hi  >  0,  h j(0)>0,  Jh i  =  l 

Set 

K(z)  =  hi(-J 

For  Z0,  e,  A  >  0,  c  e  Hioc,  define 


r[c](Z0,  A) 


sup 

0<2<Zo 


A 


2 


2  2^_  A 

r*[c](Z0,e,A)  =  sup  —  f  *  \h[  *  c'|2 
0<z<Z0  A  Jz-  f 


These  are  local  average  measures  of  fluctuation.  For  example,  r.  is  a  siz¬ 
able  fraction  of  f ,  r*  when  c  has  significant  Fourier  components  at  frequencies 
proportional  to  locally  uniformly  on  the  length  scale  A. 

The  geometry  of  the  plane-wave  problem  which  will  occupy  most  of  this 
paper  is  determined  by  the  travel-time  function 


which  gives  the  time  necessary  for  a  point  on  a  planar  wavefront  at  slowness 
p,  fixed  horizontal  coordinates,  to  travel  to  depth  z  and  back  to  the  surface. 
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The  earlier  results  of  the  author  (Symes  1986a, b)  imply  that  S  extends  to 
a  bounded,  continuous  map  on  the  bounded  set  C  H\oc( R)  parameterized 
by  positive  numbers  To,  Pi,  Co,  c\,  c*,  according  to 

Ec  =  {c£  H(oc(R)  :  c(z )  =  co,z  <0;  for 

Z0>  0  so  that  T0  =  t(Zq,Pi), 

II  i°g  cIIhi[o,z0]  ^ c*; 

c(z)  =  ci  for  z  >  Zq} 

We  have  also  shown,  however,  that  this  extension  is  not  locally  Lipschitz- 
continuous,  and  certainly  not  differentiable,  in  these  metrics.  5  does  become 
differentiable  when  the  domain  is  metricized  more  strictly  (TT3),  but  then  the 
derivative  fails  to  have  a  lower  bound.  Thus  the  implicit  function  theorem 
does  not  apply  to  the  solution  of  least-squares  problems  for  S.  The  compu¬ 
tational  consequences  of  this  pathology  are  also  striking  (Santosa  and  Symes 
1986). 

A  suitable  family  of  “rough”  subsets  E^  of  Ec  depends  on  positive  param¬ 
eters  Mi,  M2,  e  and  A  according  to 

E'  =  {c  G  Ec  :  for  Z0  >  0  such  that 

To  =  t(Zo,  Pi)  and  some  0<e<e,  0<A<A, 

the  following  inequalities  hold: 

Mi  <  r„(Z0,e,  A), 

M2r,(Zo,  e,  A)  >  max(f(Z0,  A),  r*(Z0,  e,  A))} 

We  shall  verify  that  E'c  is  nonempty  for  suitable  choices  of  parameters. 

The  main  result  of  our  paper  is: 

Theorem  1  Suppose  that  0, To  <  T\,  0  <  Pi  <  P2,  0  <  K0,  0  <  K\  <  K* 
0  <  Co,Ci,c*  are  given.  Then  there  exist  constants  Mi,M2,e,  A,m,  and  L* 
depending  on  T0,Ti,  Pi,P2,  Ko,  K\,  K* ,  co,ci,  and  c*  so  that  if  f  €  ^(R) 
satisfies  (2.1)  and 

mj  <  fh 
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then  S'  is  nonempty  and  there  exists  an  open  neighborhood  U  of  the  set 

{S[c]  6  L\R ,)  :  c  €  S'J 

and  a  map 

I-.U—r  iL(R) 

so  that 

(i)  for  c  G  S',  /S[c]  =  c; 

(ii)  for  D1,D2  G  U,  Z\  >  0 

117(1),)  -  /(D2)||wM| 

<  -  D2\\l^{Ri) 

Thus  we  obtain  a  continuous  left-inverse  for  S ,  under  various  constraints. 
The  requirement  that  c  be  constant  for  large  2  is  simply  a  way  of  controlling 
c  at  depths  below  the  zone  influencing  the  seismogram.  That  the  zone  of 
constancy  begins  at  Z0  such  that  To  =  t(Zo,  Pi),  rather  than  T\  =  t(Zq,  Pi), 
is  an  unfortunate  side-effect  of  the  “width”  of  the  source  wavelet  /:  since 
supp  f  is  not  a  point,  the  depth  interval  in  which  the  seismogram  gives 
sure  control  over  the  velocity  coefficient  is  strictly  smaller  than  the  depth 
interval  needed  to  compute  the  seismogram.  Given  an  arbitrary  velocity 
profile  with  mean  ci  near  z  =  Zq,  one  can  of  course  truncate  it  to  a  member 
of  Ec,  i.e.  by  setting  the  velocity  constant  (=  ci)  for  z  >  Z0,  as  in  the 
definition.  The  corresponding  seismograms  are  then  different  only  in  the 
“gap”  (T0  <  t  <  Ti).  If  the  original  profile  obeys  the  uniform  roughness 
conditions  as  in  the  definition  of  £',  then  it  follows  from  arguments  similar 
to  those  in  Sections  3  and  4  that  the  I2-norm  of  the  difference  of  seismograms 
is  0(m )).  The  theorem  then  gives  the  same  qualitative  estimate  for  the  error 
due  to  application  of  I .  We  leave  to  the  reader  the  formulation  of  a  theorem 
embodying  this  extension  of  our  results. 

A  more  subtle  consequence  of  this  gap  is  that  the  value  of  c  in  this  “base¬ 
ment”  region  must  be  specified  a  priori,  i.e.  the  condition  that  c(z)  =  c\  for 
2  >  Z0.  It  seems  clear  that  this  additional  piece  of  data  should  have  little 
influence  on  the  values  of  c  at  shallower  depths  —  and,  to  the  extent  that  it 
does,  should  be  determined  by  S  as  well.  This  may  be  a  fruitful  subject  for 
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further  work;  some  related  ideas  are  discussed  in  Sacks  and  Santosa  (1987). 
In  any  case  the  author  does  not  see  at  present  how  to  formulate  a  convenient 
theorem  without  such  a  restriction. 

It  is  easy  to  see  that  the  Lipschitz  estimate  (ii)  cannot  be  strengthened 
much.  See  Symes  (1986b)  for  instance.  In  particular  it  is  not  possible  to 
replace  L 2  by  Hl  on  the  left-hand  side. 

Estimates  of  the  sort  presented  in  Theorem  1  are  only  of  qualitative  im¬ 
portance.  Numerical  evidence  (Symes  1988b, c,d)  indicates  that  typical  values 
of  the  Lipschitz  constant  L*  are  very  large.  On  the  other  hand,  restriction 
to  a  submanifold  of  E'  of  small  codimension  diminishes  L*  to  a  useful  mag¬ 
nitude,  while  leaving  enough  freedom  in  the  model  that  some  information 
about  c  is  still  obtained  from  the  data.  Analysis  of  an  approximation  to  S  in 
Symes  1988a, b  illustrates  this  feature.  A  full  understanding  of  the  need  for 
this  “residual  regularization”  is  still  lacking  at  this  writing. 

The  significance  of  the  present  result  lies  in  its  reliance  on  the  implicit 
function  theorem:  i.e.  the  stability  follows  directly  from  linearization  stabil¬ 
ity,  and  any  residual  ill-conditioning  can  be  improved  by  the  straightforward 
addition  of  linear  constraints.  This  is  surprising  —  and,  perhaps,  of  practi¬ 
cal  importance  —  since  the  implicit  function  theorem  cannot  be  applied  to 
S  directly,  as  noted  above,  for  elliptic  /. 

The  left  inverse  I  will  be  produced  via  the  solution  of  an  auxiliary  least- 
squares  problem,  developed  in  Section  5.  In  Section  7  we  remove  the  elliptic 
requirement  on  /,  to  a  certain  extent:  for  suitable  /  €  Co°,  we  obtain  an 
approximate  left  inverse  to  replace  /,  which  depends  on  a  choice  of  regular¬ 
ization. 

We  conclude  this  section  with  a  brief  review  of  the  literature  and  of  the 
background  of  the  present  approach.  Recent  papers  treating  inverse  prob¬ 
lems  for  hyperbolic  equations  include  Blagovaschenski  (1974),  Bamberger  et 
al.  (1977,  1979),  Bube  and  Burridge  (1983),  Burridge  (1980),  Carroll  and 
Santosa  (1982),  Santosa  and  Schwetlick  (1982),  Bruckstein  et  al.  (1983), 
Ramm  (1987),  Fawcett  (1984),  Symes  (1986a,  b),  and  Suzuki  (1988).  (This 
list  is  by  no  means  exhaustive.)  Most  earlier  papers  in  the  mathematical  lit¬ 
erature  on  inverse  problems  for  partial  differential  equations  concern  obstacle 
scattering  (Lax  and  Phillips  (1968),  Majda  and  Taylor  (1976))  or  quantum 
mechanical  inverse  scattering  problems  (Chadan  and  Sabatier  (1977)).  Prior 
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work  on  inverse  problems  in  wave  propagation  is  to  be  found  mostly  in  the 
geophysical  literature:  notable  examples  include  Goupillaud  (1961),  Ware 
and  Aki  (1969),  and  Gerver  (1970).  Note  however  that  the  seminal  paper  in 
this  field  —  Gel’fand  and  Levitan  (1951)  —  may  be  viewed  as  a  discussion 
of  an  inverse  problem  in  wave  propagation:  see  Symes  (1979). 

While  a  few  of  the  above  references  provide  rigorous  treatment  of  the  cen¬ 
tral  uniqueness,  existence,  and  continuous  dependence  issues,  none  treat  the 
“bandlimited”  problem  described  in  Section  1  in  a  satisfactory  way:  with¬ 
out  exception  these  works  either  assume  the  low-frequency  content  problem 
away,  or  ignore  it.  The  problem  is  well-known  to  geophysical  researchers, 
and  its  severity  is  explicitly  illustrated  in  Pao,  Santosa  and  Symes  (1984) 
and  Gray  and  Symes  (1985). 

Note  that  even  the  known  uniqueness  theorems  for  several- dimensional 
inverse  problems  in  wave  propagation  either  require  data  in  a  frequency  in¬ 
terval  [0,u;)  (Sacks  and  Symes  (1985),  Sun  (1987),  Ramm  (1986))  or  do  not 
allow  the  reflection  configuration  (sources  and  receivers  separated  from  the 
target  region  by  a  hyperplane  (Nachman  (1987),  Rakesh  and  Symes  (1988)). 
Thus  the  present  paper  is  the  only  instance,  to  the  author’s  knowledge,  in 
which  any  version  of  the  reflection  inverse  problem  has  been  shown  to  be 
well-posed  in  the  presence  of  a  low-frequency  lacuna. 

Nonetheless,  evidence  of  two  sorts  indicates  that  velocities,  at  least,  are 
quite  well-determined  by  bandlimited  data.  First  conventional  seismic  data- 
processing,  as  practiced  by  academic  and  industrial  reflection  seismologists, 
appears  to  produce  such  information.  While  based  on  numerous  drastic 
approximations,  the  so-called  “velocity  analysis”  procedures  incorporate  a 
great  deal  of  data-driven  insight;  see  for  example  Yilmaz  (1987),  Ch.  3.  A 
side-effect  of  our  work  is  to  provide  a  partial  but  rigorous  mathematical  basis 
for  these  important  techniques. 

Second,  recent  numerical  investigations  of  the  nonlinear  least-squares 
problem 

min  \\s[c]  -  Sdatalli,*  (2-2) 

C 

have  turned  up  more  direct  evidence  that  bandlimited  seismograms  deter¬ 
mine  velocity  profiles:  See  Kolb  et  al.  (1986),  MacAulay  (1985),  Gauthier  et 
al.  (1986),  Mora  (1987).  All  of  these  papers  also  reveal  that  any  reasonable 
setting  of  (2.2)  results  in  a  very  difficult  optimization  problem.  The  reasons 
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for  both  the  success  and  the  difficulty  of  this  so-called  “least-squares  inver¬ 
sion”  are  noted  briefly  in  Section  3,  and  explained  in  great  detail  in  Santosa 
and  Symes  (1986),  to  which  we  refer  the  reader  for  extensive  discussion.  In 
any  case,  the  computational  difficulty  of  (2.2)  was  the  main  motivation  for 
the  work  reported  here,  which  relies  on  a  different,  “relaxed”  least-squares 
problem  (Section  5). 

In  this  paper,  we  give  only  a  qualitative  analysis  of  this  “relaxed”  prob¬ 
lem,  which  we  call  the  coherency  optimization  problem ,  leading  to  Theorem 
1.  A  quantitative  analysis  of  an  approximation  appears  in  Symes  (1988a), 
and  numerical  experiments  are  reported  in  Symes  (1988b, c,d)  establishing 
the  feasibility  of  the  optimization,  its  relative  insensitivity  to  noise  and  its 
favorable  comparison  to  (2.2)  regarding  computational  efficiency.  In  Symes 
(1988d)  the  technique  is  applied  to  field  reflection  seismograms  with  quite 
satisfactory  results. 

A  very  important  remaining  question  concerns  the  extension  of  these  re¬ 
sults  to  other  models,  notably  to  non-layered  velocities  (i.e.  c  depending 
on  all  space  variables).  As  shown  in  Symes  (1988a,b),  an  approximation 
to  the  coherency  optimization  problem  can  be  formulated  for  the  general 
nonlayered  fluid  model.  A  full-blown  extension  of  our  results  awaits  bet¬ 
ter  understanding  of  propagation-of- regularity  for  hyperbolic  equations  with 
nonsmooth  coefficients,  and  implications  for  the  relation  between  solutions 
and  coefficients,  analogous  to  the  results  for  problems  in  two  independent 
variables  detailed  in  Section  3. 
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3  Preliminary  Considerations:  Reduction 
to  Plane  Waves,  Properties  of  the 
One- dimensional  Forward  Map 

We  assume  that  seismograms  are  given  on  an  open  set  0  of  the  space-time 
boundary  of  cylinder  form: 


Q  -  Q'  x  [0,  U] 

with  f Y  a  neighborhood  of  the  “source”  point  x  =  0.  We  shall  also  assume 
that  all  velocity  profiles  c  :  R  — >  R+  satisfy 

0  <  Cmin  <  c(z)  <  Cmax  ,  2  >  0  (3.1) 

for  a  priori  fixed  Whenever  convenient  we  will  also  think  of  (3.1) 

as  an  2,°°(R)-bound  on  log  c. 

The  principal  technical  device  of  this  paper  is  the  introduction  of  the 
Radon-transformed  field 

U(p,z,t):=  /  dx  u(x,z,t  +  p  •  Xi),  p  E  R. 

JR2 

Standard  arguments  show  that  U  is  well-defined  for  small  p,  t  under  the  as¬ 
sumptions  made  so  far.  For  an  attempt  to  maximize  the  domain  of  definition 
of  U,  see  Santosa  and  Symes  (1988). 

A  straightforward  calculation  shows  that  for  suitably  small  p  >  0  so  that 
Cma xP  ^  1 1 


i  OT,  „  aw,  „  „ 

^p)-aWiz’t]  ~  = 6{z)m 

U  =  0,  t  «  0 


(3.2) 


where  the  vertical  wave  velocity  v(z,p )  (or  u[c],  to  emphasize  the  dependence 
on  c)  is  defined  by 

r  u  \ 
v[c\{z,p)  = 


yj  1  -  c2{z)p2 
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Because  of  the  a  priori  bounds  (3.1),  the  support  of  u,  hence  of  s,  is  contained 

in  a  cone  _ 

Cmax*  >  \/lX|2  +  Z<1 

Therefore,  for  sufficiently  small  pmax(<  ^)  there  exists  -w  <  0  so  that  for 


R  =  {(t,p)  :  0  <  t  <  ^max  i  \P\  —  Pmax} 


we  have  for  ( t,p )  £  R 


SM(i.p)  :=  ^(P, 0,4) -/(<)  = 

du 

fRi  dx(ln-r^(x,z, t  +  pxi)  -  f(t  +  px,)6(x)) 


(3.3) 


i.e.  the  domains  of  integration  of  the  Radon  integrals  intersect  the  support 
of  u  inside  f l.  We  assume  tacitly  in  the  sequel  that  all  (t,p)  domains  satisfy 
this  constraint. 

From  standard  facts  about  the  Radon  transform  (e.g.  Helgason  [1980], 
Section  1.4)  it  follows  that,  if  s[c]  were  square-integrable,  we  would  have 


II W  :=JJRdtdpp\S[c}(t,p)\i 


(3.4) 


(see  alternatively  Santosa  and  Symes  (1986,  Appendix  B)).  As  noted  in  the 
introduction,  s[c]  is  not  square-integrable,  but  it  can  be  shown  that  a  version 
of  (3.4)  holds  in  which  the  r.h.s.  is  replaced  by  the  norm  of  a  “pseudodiffer¬ 
ential  projection”  of  s[c]. 

Now  we  recall  some  facts  about  the  one-dimensional  seismogram  map 
5[c](-,p)  =:  5o[f]  (fixed  p)  for  which  references  are  Symes  (1986a, b).  It  is 
convenient  to  include  explicitly  the  source  wavelet  temporarily  in  the  nota¬ 
tion.  That  is,  write  50[u,/]  for  the  map  defined  by  the  solution  of  (3.2), 
(3.3)  followed  by  restriction  to  fixed  p.  Recall  that  /  is  assumed  to  define  an 
elliptic  convolution  operator  of  order  zero.  For  the  choice  f  =  6,  So  defines  a 
bounded  continuous  map  from 

H1,+  ( R,t>0)  =  {u  G  tf^R)  :  v  =  v0  for  2  <  0,logv  <E  H{oc( R)} 
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into  L2[0,T]  for  any  T  >  0,  but  S0[-,6]  is  not  locally  uniformly  continuous 
(Symes  1986b).  In  order  to  recover  the  necessary  degree  of  regularity  for  the 
arguments  to  follow,  we  introduce  the  “travel-time  velocity”  v[c]  defined  by 

VO  T  =  V 


where 

fz  1 

r{z)  =  2/  - 
Jo  v 

is  the  (one-way)  travel-time.  More  discussion  of  the  map  c  u[c]  appears  in 
Section  5;  see  also  Symes  (1986a).  A  short  calculation  shows  that  U,  defined 

by 


U(T(z),t)  =  U{z,t) 


satisfies 


Set 


1  d2U ,  ^  d  1  dU,  ^  1  tu^t  , 

W)~5i(x' "  7jmx) 


■So[e,/](()  =  ^(0,0 


We  recapitulate  a  number  of  properties  of  So.  With  exceptions  noted  be¬ 
low,  all  of  these  may  be  found  in  Symes  (1986a).  Note  that  So[v,f]  = 
Sb[n,/]  if  v  o  r  =  v.  So  also  defines  a  bounded  map:  H^{R,v0)  — *  Iqoc(R) 
for  any  u0  >  0.  Moreover,  So  is  actually  of  class  C2,  viewed  as  a  map: 
H1’+([0,T/2],Vo)  -*  L2[0,T].  The  derivative  is  given  by  the  formal  pertur¬ 
bation  ( Sv  G  Hioc( R),  ^5  =  0,  2  <  0) 


d26U  d  ( 

SvdU,  ' 


1  d6U\ 
5(x)  dx  ) 

,  x  >  0 


OM) 


SU  =  0,  t  «  0 


DS0[v,  f]8v  = 


d6U 


dt 


x=0 
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so  that 


\\S0[v  +  6v,f ]  -  50[u,/]  -  DS0[v,f ]  •  <$6||z,2[o,:r] 

=  O  (|iH|tf»([0,T/2])) 

For  /  =  <5,  more  is  true:  for  constants  C_,  C+  >  0  depending  on  ||  log  v\\w[o,t/2] 
and  on  T, 


d8v 

1  ~  r 

c. 

dx 

< 

L2[0,T/2] 

\DSo[v,  <5]<56 

£2[0,T] 


<c+ 


dSv 


dx 


(3.5) 


lL2[0,T/2] 

Also,  for  /  =  H  (the  Heaviside  function),  there  exists  Co  depending  on 
||  logu||//i[0)r/2]  and  on  T  so  that 


\\DS0[v,  H]8v\\ L2[0tT]  <  Co||HU2[o,:r/2] 
Note  that  for  Sv  €  H\oc,  jDSo[v,  H]8v  €  H^oc  and 

OS„\v,i\  =  j(DSo\v,H] 


(3.6) 


Also 

So[v,f]  =  f*So[v,8] 

DSo[v,f]  =  f*DS0[v,6 } 

All  of  these  results  are  based  on  simple  local  energy  estimates;  with  the 
exception  of  (3.6)  they  are  stated  explicitly  in  Symes  (1986a).  The  Heaviside 
estimate  (3.6)  is  not  given  there,  but  the  proof  presents  no  novelties;  it  will  be 
given  in  any  case  in  the  forthcoming  monograph  on  layered  inverse  problems 
by  Santosa  and  the  author. 
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4  Ellipticity  for  the  One-dimensional 
Forward  Map 

We  have  assumed  (until  Section  7)  that  /  G  £'(R)  defines  an  elliptic  convo¬ 
lution  operator  of  order  zero.  Since  D5o[u,  6]  for  log  v  G  H\oc  is  invertible, 
((3.6),  (3.7)),  it  seems  clear  that  DSo[v,f]  =  f  *  5o[u,  6]  should  be  “elliptic” 
as  well.  The  purpose  of  this  section  is  to  formulate  and  prove  a  precise  result 
along  these  lines,  keeping  track  of  the  dependence  of  various  constants  on 
the  TT-norm  of  v,  the  time-  and  depth-intervals  used,  etc.,  etc.  The  result 
is  a  Garding-type  estimate  for  DSo ,  which  requires  that  DSo  be  given  on 
[0, 7\],  T\  >  To,  to  estimate  8v  on  [0,  To].  It  is  clear  that  a  little  “extra”  data 
is  required,  since  the  support  of  /  is  not  assumed  to  be  a  point.  In  fact, 
the  principal  constant  intervening  in  the  estimates  is  the  first  moment  of  /, 
which  measures  its  “spread”,  and  is  related  in  the  estimates  to  the  size  of 
the  necessary  “margin”  Ti  —  T0. 

Select  a  cut-off  function  ip  G  (^“(R)  with  ip(t)  =  0  for  t  >  T\  and  ip(t)  =  1 
for  t  <  T0,  and  define  the  smoothly  cut-off  version  of  So : 

S v,  :=  ip  So 


so  also 

DS^  :=  ipDSo 


In  the  following,  we  will  apply  the  estimates  of  the  previous  section  on 
various  t-intervals.  Since  the  constants  C7_,C+,  etc.  depend  on  the  length 
of  the  interval,  we  will  include  the  length  explicitly  in  the  notation,  for  the 
moment.  Thus  for  estimates  on  the  interval  [0,  T],  C_  becomes  C_[T],  etc. 
These  constants  depend  on  the  /T-norm  of  log  v  on  the  appropriate  intervals. 

Recall  that  “elliptic”,  applied  to  /,  means  the  inequalities  (2.1),  which 
we  recall  here  for  convenience:  for  <p  G  H1( R), 


d(j> 

> 

dx 

L2(R) 

K * 


supp /  C  R+ 


,  d<p 

f*  dx 


>  Kx 


d(p 


dx 


R) 


L=(R) 


—  K0  ||^||i2(R.) 


(4.1) 
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From  (3.5)  and  (3.6),  for  \ogv  €  KC(R),  6v  €  H}oc(R),  t;(x)  =  vQ  for  x  <  0: 

||D5v,[5,^]<55||l2(R)  <  ||i?5o[w,5]^t;||i(2[oIT1] 

_ II  aw  II  (4-2) 


while 


<  C+[T,1 


Il2[0,Ti/2] 


||jD5v,[n,^]^||L2(R)  >  |p50[tj,5]6u||L2[0iTo] 


2  f. 


L2[0,T0/2] 


Wds^v, h]6v\\L2(r+)  <  Co[r1]||^|u2[o,T1/2] 

DS^[v,f\Sv  =  ipf*DS0[v,8}8v 
=  f  *  DS^[v,8]8v 


where  for  e  we  have  the  standard  commutator  estimate 

||e||z,2(R)  <  \\^'\\L^rn)\\DSo[v,8}6v\\L2[0>Tl] 
Here  m)  denotes  the  measure  norm  of  the  distribution 

<i> • — *  (I f\M) 

i.e.  the  first  moment  of  /,  as  explained  in  Section  2. 
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Combine  this  estimate  with  (4.1)-(4.4)  to  get 

\\DS0[v,  f]Sv\\L2[0iTl]  >  \\DSj,[v,  f]6v\\L2(K) 

>  11/  *  §-tDSo[v ,  tf]H|£2(R)  -  llelk2(R.) 
>\\f*§-trl;DS0[v,H}6v\\L,{K) 

-\\f*^DS0[v,H}Sv\\LHR)-\\e\\L,m 

>  Ki\\§-tijjDSo[v,  H}8v\\L2(R)  -  K0\\'ipDSo[v,H]8v\\LnR) 

-K-\\^DS0[v,H]8v\\LHK)  -  ||e||£2(R) 

>  A'i||Z)5o[u,<5]<5C||£2[o,t0]  -  K0\\DSo[v,H]8v\\L2[0tTl] 

>  KlC- [To] ||  II L2[0,Tc./2l 

-Com]  (ko  +  K ■  (If ||t.(R))  ||<S||ti[0,T,/!l 

-m;||f||i.(R)C+mill^llL.|o,r,/!| 

The  upshot  of  all  of  this  is  the  inequality 

imsois./iHUv,,  >  ^.c-milf  ||tlIOTo/2) 

-Co[T,]  +  1( •  Iff  ||^(R))  ||HUv/*l 

Remark.  As  noted  in  Symes  (1986a),  in  general  the  constants  C+[T],  Co[T] 
increase  with  T,  while  C-[T\  decreases. 

Remark.  It  is  worth  noting  the  relation  of  the  various  bounds  involving  /  to 
its  Fourier  transform.  Indeed,  obviously 

K-  >  Il/|U~(R) 
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whereas  K0,  Kx  are  related  to  the  detailed  behaviour  of  the  Fourier  transform 
near  u>  =  0.  Suppose  that,  for  some  K*,  ft  >  0, 


|/M|  >  Km  for  \u>\  >  ft  (4.5) 

(i.e.  [ft,  oo)  constitutes  the  “passband”  of  /,  measured  with  tolerance  K „). 
Then  for  <f>  €  H'CR),  it  is  easy  to  see  that 


/* 


df 

dt 


>  K. 


lL2(R) 

which  gives  the  relations 


d$ 

dt 


LHR) 


ft||<£IU2(R) 


Kx  >  K„  K0  <  fttf. 


Note  also  the  effect  of  scaling:  if  /  satisfies  (4.6),  then 


m  ■■=  ;/({) 

satisfies  (4.6)  with  ft  =  ft/e,  while  =  emj.  Thus  Kx  —  0(  1),  I< 0  =  0( ^), 
and  m){  =  0(e )  as  e  — *■  0. 

It  is  clear  from  the  preceding  discussion  and  the  form  of  (4.5)  that,  for  any 
prescribed  Tx  >  T0,  ip  as  above,  and  any  “base”  source  wavelet  /,  a  scaled 
version  of  /  will  have  small  enough  first  moment  that 


m, 


dip 


dx 


L°°(  R) 


C+[T,]  <  i/C.C.ITol 


with  fixed  Kx  (independent  of  scaling).  Note  that  we  can  certainly  choose  ip 
so  that 

U¥ 


dx 


< 


L°°(  R) 


T1!  -To 


These  obserations  establish  the  non-vacuousness  of 


Theorem  2  Choose  Tx  >  T0  >  0,  and  K*  >  Kx  >  0.  Then  for  any  logu  € 
ff£c(R),  there  exists  T2,  Ti,  T0  >  0  depending  on  v  and  on  TX,T0,  I<*,K x, 
and  so  that  if  f  satisfies  (4-1)  with  some  K0  >  0  and 

mlj  <  m 


4.4 


then  for  8v  €  Hioc(R), 


\\DSo[vt  f]6v  ||l2[o,t,]  +  m)L 
I  d6v 

L2[0,T0/2] 


d8v 


>L, 


dx 


dx 

-Lo\\6v\\L2[otTl/2] 


L2[T0/2,T1/2] 
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5  The  Optimum  Coherency  Principle 


While  we  have  shown  that  the  linearized  one-dimensional  forward  map  is  el¬ 
liptic  under  the  circumstances  which  concern  us,  it  is  certainly  not  boundedly 
invertible  —  or  rather,  the  hypotheses  concerning  /  do  not  imply  any  uniform 
bound  on  the  inverse.  This  circumstance  is  widely  remarked  in  the  literature; 
for  a  sampler,  see  Santosa  and  Symes  (1986),  where  numerical  examples  are 
also  given  (see  especially  Chapters  6  and  7).  Recall,  however,  the  provenance 
of  the  one-dimensional  problem:  it  governs  the  propagation  of  a  plane  wave, 
the  surface  data  for  which  are  identical  to  the  Radon  transform  of  the  point 
source  surface  data  at  fixed  slowness  (or  angle).  The  possibility  remains  that 
the  collection  of  all  (precritical)  plane- wave  data  might  constrain  the  velocity 
estimate  more  severely  than  does  a  single  plane- wave  component. 


In  this  and  the  next  section  we  confirm  this  possibility.  Since  we  will 
consider  the  data  in  an  interval  of  slownesses  Pi  <  p  <  P?,  we  will  work  with  a 
suite  of  travel-time  velocity  models  {v(t,p)  :  0  <t  <T,  P\  <  p  <  P2}.  Recall 
from  Section  3  that  these  are  derived  from  velocity  profiles  c(z);  accordingly 
we  begin  with  the  question:  what  condition  must  v{t,p)  satisfy  in  order  that 
v  =  6[c]  for  some  c?  That  is,  we  seek  an  operator  whose  kernel  is  identical 
to  the  range  of  c  — »  u[c].  We  will  call  membership  in  the  null  space  of 
the  required  operator  (or  in  the  range  of  c  — ►  n[c])  the  coherency  condition , 
since  coherence  of  the  travel-time  velocities  v  is  then  forced  across  various 
values  of  p:  all  are  representations  of  the  same  mechanical  model,  in  different 
coordinate  systems. 


Denote  by  ((t,p)  the  inverse  of  the  two-way  travel  time  function  r(z,p), 


i.e. 


t  =  2 


dz 

v(z,p) 


Then  clearly 

rt/ 2 

C(t,p)=/  drv{T,p ) 

Jo 

We  will  now  regard  (  as  being  defined  by  this  formula,  hence  as  a  functional 
of  v.  So,  given  v  we  can  compute  £,  whether  v  —  v[c]  for  some  c,  or  not. 
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Thus  we  can  compute  the  quantity 


7  [5]  = 


1 


[(uoC-1)2 


+  P 


(Here  vo(  1(z,p)  =  6(C  1(ziP)iP))-  Referring  to  the  definitions  (Section  3), 
we  see  that  if  v  =  5[c],  then 

7  [5]  =  \ 


and  is  thus  independent  of  p :  that  is, 


d 


v  =  t)[c]  =>■  —  7[{>]  =  0 

op 


(5.1) 


This  last  condition  still  involves  the  travel-time  change  of  variables,  so  does 
not  define  a  sufficiently  regular  function  of  v  (recall  the  discussion  in  Section 
3).  Define  instead 


u3 


QPl  =  — 2 


dv 


7{u] 


°c 


d_ 

dp 

^(corxr2|  °e  -Pv3 
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dv 


~3 


dp  v  dp 

A  short  chain-rule  calculation  gives 

d 


=  \  +  (fllC  )°C^7  i~pv 


dt 


dp  )  v  Jo  dp 


dp 


dp 


which  relation  allows  us  to  view  Q[u]  as  a  functional  of  v.  Clearly,  from  (5.1) 

v  =  u[c]  =>  Q[v\  =  0 


It  will  be  important  to  define  the  coherency  condition  in  such  a  way  as  to 
have  the  largest  possible  domain  contained  in  H^( R  x  [Pi,P2])  (which  will 
be  the  natural  domain  for  the  forward  map,  defined  below).  An  obvious 
choice  is  H1,  but  Q  is  not  continuous  in  that  topology.  As  it  happens,  we 
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can  replace  Q  with  another  operator  having  the  same  kernel,  but  which  is 
continuous  (even  C°° )  in  the  H 1  sense. 

Suppose  temporarily  that  v  =  u[c]  for  some  c.  Then 

r*/2  dv\  dv  ~ 


Iq[v\  = 

V  V  dp  V1  \J 0 


d  (2  [t/idv  n  t'r* 

-  — -2  p  v 

v  Jo  op  Jo 


dp)  dt 
</ 2 


—  pv 


dt 

=  0 


dp 


Thus 


The  map 


_  .  2  ['I*  dv  n  (W  _2 

v  =  ti  c  =>  ~  ^-  =  2p  /  v 

v  Jo  dp  Jo 


(5-3) 


«P]:=|-2p 


r‘n  -j)  96  -3 

o  ”  )m-pv 


therefore  also  satisfies 

v  —  6[c]  Q[v]  =  0  (5.4) 

On  the  other  hand,  for  any  T  >  0,  R  =  [0,  T]  x  [P^Pj],  Q  obviously  defines 
a  C2-map 

Q  :  H\R)  ->  L2{R). 

The  following  converse  to  (5.4)  shows  that  Q  =  0  is  an  adequate  coherency 
condition: 

Lemma  1  Suppose  that  log  v  £  H1(R)  and  Q[v\  =  0.  Then  for  some  Z  >  0 
and  some  c  £  //X[0,Z], 

V  =  v[c]|fl 

Proof:  Integrate  in  t: 

0/ 2 


rtf* 

0  =  Jo  dt'Q[v](t' ,  p) 

rt/2  flf,  rt/ 2 

=  jf  dt'—(t',p)-2pv(t,p)J^  dt'v2(t',p) 


<Hlu> 

dt 


(t',p)  -  pv3(t\p) 
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after  integration  by  parts,  so  we  once  again  recover  the  relation 

pt/2 


v  Jo  op  Jo 


It  follows  immediately  that  Q[v\  =  0  as  well,  which  is  equivalent  to 

d 


dp 


7[u]  =  0 


Set 


R<  =  {(^,p)  :  Pi  <  P  <  Pi,  o  <  z  <  C(r,p)} 
v  =  v  o  (_1 

It  is  easily  checked  that  logu  6  Hl(R^).  On  the  other  hand  with 

\  -1/2 


c(z)  :  = 


1 


+  P 


\v(z,P)2  / 

=  7[^]-1/2(2iP) 

we  have  c  €  Hl[ 0,  Z],  Z  -  supPi<p<p2  ((T,p),  and 

t  \  c(*) 

v(z,p)  = 


yjl  —  C2(z)p2 
C\z,P)  =  2  /  - 

Jo  V 


whence  £_1  =  r,  and  the  conclusion  follows. 

q.e.d. 

We  now  turn  to  the  definition  of  the  multi-plane-wave  forward  map.  For 
simplicity,  define  seismograms  on  the  data  rectangle 


R\  —  [0,  T\]  x  [Pi,  P2 
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Choose  Co,  Ci  >0  with  cqP2,c\P2  <  1,  T2  >  T0,  and  i*  >  0  and  set 

E  =  {5e^11oc(Rx[A,P2]): 

Co 


v{x,p)  = 


v(x,p)  = 


\/l  -  cop2  ’ 

Cl 


a:  <  0, 


\j  1  -CiP2 

rr>|T2 

II  l°g^ll//l([0,5T2]x[Pa,P2])  < 

and  its  “tangent  space” 

E'=  {«efftc(Rx[P„ft]): 

<$u(;r,p)  =  0,  i  <  0  or  i  >  |T2} 

Only  a  finite  interval  in  t  is  needed  for  the  arguments  which  follow.  With 
T2  >  Ti  to  be  determined  below,  set 

R2  =  [0,  T2/2]  x  [Pi,P2] 

Identify  elements  of  ^  and  ^  with  their  restrictions  to  R2,  and  topologize 
Z and  z  as  subsets  of  Hl(R2). 

With  these  conventions,  define  the  forward  map 

S  :  E  —  L2{Rx) 

by 

5[u](t,p)  =  So[u(-,p),/](0 

with  So  as  in  Section  4.  From  the  results  stated  there  and  in  Section  3,  it 
follows  that  S  is  of  class  C2,  with  derivatives  bounded  in  terms  of  Co,Xi,/, 
and  (.*. 

The  derivative  PS[fi]  also  obeys  an  “elliptic”  estimate.  To  state  this,  set 
Ro  —  [0,To/2]  x  [Pi,P2],  R\  =  [0, Ti/2]  x  [Pi,P2]. 

Then  from  Theorem  2  follows 
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Theorem  3  Given  P2  >  Px  >  0,  Tx  >  T0  >  0,  K*  >  Kx  >  0,  and  t  >  0, 
there  exist  fh,  L0,  L\,  L2,  L\  >  0,  so  that  for  K0  >  0  and  f  £  £'(R)  satisfying 
(f.l)  and 

mlf  <  m 

so  that  for  v  £  8v  £  ]£ 


||D5[u]^5||L2(Ri)  +  m)L2 
I  d8v 


d6v 


>LX 


dx 


dx 
-  L0\\8v\ 


L2(Ro) 


Moreover,  for  each  p  £  [Pi,  P2], 


\l2(R1\R0) 

L2(Ri) 


||£>5[v]^u(-,p)||£,2[0,r1]  +  m)L2 
I  d8v 


dSv 


dx 


(•>P) 


>LX 


dx 


■(•iP) 


L2{  0,To/2) 


L2  [To,/2,Ti/2] 

Po||^(-,p)||£,2[o,tl/2] 


In  view  of  Lemma  1  and  the  obvious  relation 

5  [5[c]]  =  S[c] 

we  can  now  state  a  version  of  the  inverse  problem  closely  related  to  the 
least-squares  problem  (2.2),  as 

minimize  ||5[u]  -  D\\\2(Ri)  over  u  €  £  .  , 

subject  to  Q[v]  =  0 

In  fact,  a  solution  to  this  problem  clearly  yields  a  solution  to  (2.2)  on  a  suit¬ 
able  depth  interval.  On  the  other  hand,  this  problem  would  appear  to  have 
the  advantage  of  regularity:  both  the  objective  and  constraint  functions  are 
of  class  C 2.  Moreover,  it  is  possible  to  show  that,  under  the  circumstances 
described  in  Theorem  4  below,  the  Hessian  operator  of  the  objective  func¬ 
tion  is  positive  definite  on  the  null  space  of  the  linearized  constraints,  at  a 
consistent  data  set,  i.e.  when 


D  =  S[c*] 
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for  suitable  c*. 

Unfortunately  these  properties  are  insufficient  to  yield  a  stable-local- 
existence  result.  To  motivate  the  next  step  in  the  development,  we  digress, 
with  a  brief  review  of  Lagrangian  theory  for  constrained  optimization,  and  a 
simple  but  closely  related  example. 

Suppose  that  X,  Y  are  Hilbert  spaces,  f  :  U  — <•  R,g  :  U  —>  Y  smooth  on 
an  open  set  U  C  X.  As  is  well-known  (Luenberger,  1973),  a  local  solution  of 
the  constrained  optimization  problem 

minimizexeu  f(x ) 
subject  to  g(x)  =  0 

is  a  critical  point  of  the  Lagrangian 

C(x,\)  =  f(x)  +  (A  ,g(x)) 

We  wish  to  apply  the  implicit  function  theorem,  to  assure  that  a  solution  is 
stable  against  (C2— )  perturbations  in  /.  The  conditions  necessary  to  apply 
the  implicit  function  theorem  to  the  critical-point  problem 

grad  X,\£(x,  A)  =  0 

are  also  sufficient  to  ensure  the  convergence  of  Newton’s  method  (and,  gener¬ 
ally,  of  its  computationally  efficient  quasi-Newton  relatives).  These  amount 
to 


Dg(x)6x  —  0 

(i)  =>  ( 6x ,  [Hess  f(x )  +  (A,  Hess  g(x))],6x) 

>  -MMi2 

(ii)  ||T><7(z)*A||  >  4||A|| 

for  constants  l\,li  >  0.  The  so  called  “second-order  sufficiency”  condition 
(i)  may  be  verified  for  the  problem  (5.3),  as  was  mentioned  above.  The  so- 
called  “constraint  qualification,”  however,  fails,  in  the  manner  illustrated  by 
the  following  simple  example,  which  captures  the  main  features  of  (5.5). 
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Let  X  =  {u  e  H1( R )  :  /  fRu  =  0},  Y  =  L2(R),  R  =  [0,  l]2  the  unit 
square  in  R2.  Set  for  u  E  X,  v  E  Y  given, 


/(«) 


du 

dxi 


2 


g{y)  = 


du 

dx2 


and  let  us  suppose  that  v  =  v(xi),  so  that  the  problem 


minimizeu€x  grad  f{u) 

subject  to  g(u)  =  0 


has  the  unique,  zero-residual  solution 

[X 1 

u(x i,  x2)  =  /  dx  u(x)  +  const. 

Jo 

The  second-order  sufficiency  condition  (i)  is  an  immediate  consequence  of  a 
form  of  Poincare’s  inequality  (Necas,  1967).  The  adjoint  Dg*(u )  is  given  by 

Dg-(u)  =  -MX 

where  Af  is  the  solution  operator  of  the  Neumann  problem:  Afb  =  w,  where 


(. I  —  A )w  =  b  in  R 

^  =  0  on  dR 

dn 

Set  A k(x)  =  cos  ki7rxi  cos  /c? 7rx2  for  k  G  Z2.  Then 


||D5*(u)Ai||wi(H)  = 


2  k2 

+  k\  +  k2 


which  can  be  made  as  small  as  one  likes  by  taking  ki  large.  Thus  the  con¬ 
straint  qualification  (ii)  fails. 

For  similar  but  nonlinear  problems  such  as  (5.5),  the  nonzero  singular 
values  of  Dg  are  proportional  to  the  instantaneous  radii  of  curvature  of  the 
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constraint  set.  The  failure  of  the  constraint  qualification  could  conceivably 
be  associated  with  the  presence  of  arbitrarily  small  radii  of  curvature  — 
indeed,  this  must  be  the  case  for  uniformly  nonlinear  problems  like  (5.5).  In 
essence,  the  constraint  set  for  such  a  problem  has  a  cusp  at  every  point!  No 
reasonable  stability  properties  can  be  expected  for  the  solutions  of  such  a 
problem. 

On  the  other  hand,  the  Hessian  operator  of  the  “penalized”  cost  functional 


f(u)  +  a2\\g(u)\\lHR) 

is  positive-definite  for  any  a  >  0  —  again,  this  is  simply  Poincare’s  inequality. 
This  observation  motivates  the  following  construction. 

For  choices  of  time  and  slowness  intervals  T2>  T\>  To  >  0,  P2  >  P\  >  0, 
“data  set”  D  6  L2(R\)  and  “tuning”  parameters  er,  A  >  0,  define  for  v  6  J2 


4.a[«]  =  ;  { p[«]  -  D"2 


2  Ul 
+A2 


+  <?2  \\Qt\\lHR. 


dv 


dx 


L2(Ri) 


L*(R2\R o). 


(5.6) 


The  first  two  summands  in  the  definition  of  Jaj\  are  motivated  by  analogy 
with  the  “Dirichlet”  problem  discussed  above.  The  form  of  the  “elliptic” 
estimate  ( Theorem  3 )  and  the  need,  explained  in  the  next  section,  to  choose 
T2  >  To  motivate  the  last  term.  For  example  the  elliptic  estimate  gives  a 
bound  on  6v  only  on  the  shorter  interval  [0,  To],  so  Sv  must  be  bounded  a 
priori  for  t  >  T0. 

We  shall  call  the  three  terms  on  the  r.h.s.  of  (5.6)  “data,”  “coherency,” 
and  “extension”  terms.  Minimization  of  (5.6)  is  the  “coherency  optimization 
problem.” 
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6  Proof  of  the  Main  Theorem 


We  begin  this  section  with  the  proof  that  the  Hessian  operator  of  Ja<\  as 
defined  in  Section  5  is  positive-definite  at  a  sufficiently  rough,  coherent  v. 
The  main  idea  is  that  the  Hessian  quadratic  form  consists  of  the  “data”  term 
dominating  the  high-frequencies  in  Sv,  and  a  “coherency”  term  essentially 
consisting  of  the  product  of  the  indefinite  integral  (in  x)  of  Sv  and  a  derivative 
of  v.  This  latter  term  is  thus  the  product  of  a  smooth  factor  (depending  on 
Sv)  and  a  rough  factor  (derivative  of  u).  When  the  rough  factor  is  rough 
enough,  uniformly  on  the  length  scale  of  significant  change  in  the  smooth 
factor,  then  the  product  dominates  the  smooth  factor.  Lemma  3  below  makes 
this  heuristic  reasoning  precise,  and  establishes  a  mathematical  meaning  for 
“sufficiently  rough.”  The  “smooth  factor”  discussed  here  is  the  indefinite  ( t -) 
integral  of  Sv\  the  estimate  for  it,  together  with  the  elliptic  estimate  from 
Section  5  and  an  interpolation  argument,  given  a  bound  on  Sv  in  terms  of 
the  “data,”  “coherency,”  and  “extension”  terms  of  the  Hessian  of  Ja,\. 

Recall  from  the  preceding  sections  the  geometry  of  the  coherency  opti¬ 
mization  problem: 


0  <  T0  <  Ti  <  T2 
0  <  Pi  <P2 


time  limits 
slowness  limits 


Rv  -  [0, Tv]  x  [Pi,P2]  1  n  1  « 

Ry  =  [0,r„/2]  x  [Pi,  Pa]  J 

(data,  model  rectangles); 


the  function  spaces  involved  in  its  setting: 


J2  =  l  v  €  H\R2)  :  v{0,p)  =  —7=^===,v{T2,p)  = 


0“  Co  P2 


0  ~  ciP2 


£  =  {*«€  H\Ra)  :  «(0,p)  =  0  =  Sv(T2,p)} 
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and  the  maps: 


5:  £  — > 

ds-.  £*£— >iW 

Q,  £  — .  £2(ij2) 

0<3:  £  x  £  — *  I?(h) 

It  will  be  essential  in  the  arguments  given  below  that  T2  be  related  appro¬ 
priately  to  T0,  as  follows.  For  v  6  the  L°°-bound  on  u  implies  that 

CCTcA)  =  suppj<p<.p2  C(r0,p) 

<  |Toer  :=  Z 

whence 

r(Z,Pi)  =  supPl <p<p2  r(Z,p) 

<  2Zer  =  T0e2** 

Set  T2  =  T0e2*\  Then  it  follows  that  for  any  pi,p2  G  [Pi,P2], 

/r(C(?o,Pi),P2)  <  T2 

Define 

Rz  =  [0,Z]  x  [Pi,P2] 

Then  for  G  ^^(R),  i)6E 

IIHItfi(Po)  -  C'll<^OTIU1(Kz) 

<  C"||6u||j/i(h2) 

and  similarly  for  L2-norms. 

Here  we  have  introduced  the  habit,  to  which  we  shall  uniformly  adhere, 
of  denoting  by  C,  C', . . .  constants  which  may  be  chosen  uniformly  over  £. 
We  shall  make  no  attempt  to  identify  optimum  choices  of  such  constants,  so 
that  the  end  result  of  our  argument  is  only  qualitative  in  nature. 

Recall  from  Section  2  that  the  determination  of  c  from  band-limited  data 
requires  that  c  be  “rough”  in  some  sense.  For  convenience,  we  restate  the 
basic  “roughness”  criterion,  phrased  in  terms  of  an  (arbitrary)  Dirac  kernel 

ht{z)  =  ^i(|) 
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where  hx  satisfies  the  usual  requirements: 


hi  €  C£°(R),  hi{0)  >0,  hi  >  0  , 

/A'  =  1- 

Then  {eA'}  is  bounded  in  L1,  and  defines  a  family  of  “low-cut”  filters,  i.e. 
convolution  with  eh[  suppresses  Fourier  components  at  frequencies  <  O(^). 

For  c  G  Hioc( R),  t  >  0,  A  >  0  define 

r[c](z,e,A)  =  \etie*c'\2 

r[c](z,A)  =  ^ 

and  for  any  Z0  >  0, 

r,[c](Z0,e,  A)  =  o<mf^  r[c](z,  e,  A) 
r*[c](Z0,e, A)  =  sup  r[c](z,e,A) 

0<z<Z0 

r*[c](Z0,A)  =  sup  f[c](z,A) 

0<z<Z0 

The  main  step  in  the  proof  of  Theorem  1  is  embodied  in: 

Theorem  4  There  exist  constants  fh,  Mi,  M2,  e^,  and  Ao  >  0  depending  on 
To,Ti,T2,t,K0,Ki,  and  K*  so  that  if 

(i)  /  G  S'  satisfies  (f.l)  and  mlf  <m\ 

(ii)  v  G  ^  is  consistent  with  D  G  L2(Ri),i.e. 

S[5]  =  D 

(iii)  v  G  '^2  coherent,  v  =  u[c]  for  c  G  H\oc{ R)  ; 
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(iv) 


for  some  t ,  A,  Zq  >  0  with  t  <  e0,  A  <  Aq,  and 


t(Z0,  Pi)  =  2  f0z°  dzyf^pn*  >  To 
the  following  inequalities  hold: 
r,[c](Z0,c,  A)  >  Mi 

M2rm[c](Z0,  e,  A)  >  max(r*[c](Z0,  e,  A),  f[c](Z0,  A) 

Then  there  exists  p  >  0  depending  on  T0,  Ti,  T2,£*,  K0,  K\,  a,  and  A  so 
that  for  6v  £  JP  , 

(Sv,  Hess  >  /i||^u||  fj'(Ri) 

Before  giving  the  proof  of  Theorem  f,  we  digress  to  demonstrate  the 
meaning  of  condition  (iv)  in  the  theorem,  and  show  that  the  set  of  v  satisfying 
it  is  non-empty. 

Let  Co,6ci  6  C°°(R)  fl  L°°(R),  so  that 

rz+1 

0  <  r,  <  /  \h[  *  6c'A2  <  r*,  —  oo  <  z  <  oo 

Jz- 1 

and  take  r  =  sup26R  //+,1  |6c'x|2.  Clearly  we  can  choose  6c\  so  that  r*  >  r»,  f 
achieve  any  prescribed  values.  On  the  other  hand  set 

6ce(z)  =  2eSci(- ) 

ce(z)  =  co(z)  +  6c((z) 

for  0  <  z  <  Zo,  cut  off  to  constants  elsewhere.  Then  {logce}£<£0  is  bounded 
in  fLx[0,  Z0],  for  suitable  e0,  but 

r.  -  0(e)  <  1  f+<  | eh',  *  c(| 2  <  r"  +  0(e) 

Z6  J z—t 

i£‘|c;i  =  r-+o(e) 
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Thus  with  the  choice  A  =  2 e,  the  quantities 

r*[c£](Z0,e,2e),  r[c£](Z0,2e) 

stand  in  more-or  less  fixed  proportions  throughout  the  set  {c£}.  We  conclude 
that,  whatever  the  values  of  the  constants  M\,  M2,  e,  A  specified  in  Theorem 
4,  the  set  of  travel-time  velocities  satisfying  condition  (iv)  is  non-empty  — 
simply  take  v  =  u[c£]  for  sufficiently  small  e,  and  a  suitable  choice  of  Co,<$Ci. 

Also  the  meaning  of  condition  (iv)  is  clear  from  this  construction.  As 
e  — »  0,  the  perturbation  Sct  becomes  smaller,  but  its  derivative  has  uniformly 
bounded  (above  and  below)  mean-square  over  intervals  of  length  e,  and  this 
even  after  convolution  with  the  oscillatory  kernel  eh't.  Thus  c£  has  significant 
oscillation  everywhere  on  the  scale  e,  i.e.,  c£  is  “uniformly  rough.” 

In  the  estimates  which  follow,  we  will  write  for  convenience 

6Q  =  DQ[v]6v 
8S  = 

Also,  for  any  function  u  of  (t,p)  or  ( z,p )  we  shall  denote  by 

Ju 


the  function 


(t,p)*->  [  dt'u(t',p)  or  (z,p)  ■->  [  dz'u{z',p). 
Jo  Jo 

To  begin  the  proof  of  Theorem  4 ,  note  that 

(Si 5,  Hess  J< Tia[u]  •  Sv) 

—  ll^lli,2(R1)  +  a2\\^Q\\2L*(R2) 

+A2 


dSv 


dX  II L*(A2\Ho) 

because  of  the  consistency  and  coherency  assumptions. 


(6.1) 
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The  second  term  in  (6.1)  is  most  interesting.  It  follows  immediately  from 
the  definition  of  Q  that 


6Q 


-2p(/62)lr_4p(/eM)i f-3^40 


o 


where  we  have  written  for  convenience  bv  =  bv  o  r.  Note  that  bv  is  not  the 
perturbation  in  v  resulting  from  a  perturbation  in  c. 

Choose  a  test  kernel  g  €E  C^°(R2)  with  ||<7||n(R)  =  1  and  supp#  C  [0,oo). 
Then 


g  *  ( bQ  o  r) 


d  , 

—g  *  bv 

Op 


?(/ 


dz  bv 


dv2 

9*  a7  +  £l 


—3 pg  *  ( v2bv ) 


The  error  term  Ex  is  the  commutator  of  a  multiplication  operator  and  con¬ 
volution  with  g: 


E\—p 


dz  bv)  g  * 


dv2 

dz 


-  g* 


for  which  a  standard  estimate  gives 

(We  have  used  the  notation 

mg=  JRdz  zk\d(z)\ 

for  the  moments  of  |^|  as  explained  before.) 
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The  next  goal  is  an  L 2  estimate  for  Kg  on  the  domain  Rz  =  [0,  Z]  x 
[Z3! ,  P2 ] .  Recall  that  Z  is  chosen  so  that 


T0<t(Z,P2)<t(Z,P1)<T2 


First  note  that  for  its  indefinite  p-integral, 

/•P2  rvi  P2  m 

/  dp  Kg  =  g  *  ( 6Q  o  t)  —  g  *  8v  —  dp  E\ 

dpi  dpi  p=Pl  dpi 


Since  composition  with  r  and  the  indefinite  p-integral,  and  z-convolution 
with  g  are  all  bounded  operators  on  L2oc{ R  x  [pi,P2]), 


CPI 

/  dpKg 

JP1 


L2[0,Z] 


<  C  {|| g  *  ^(•>Pi)IIl2[0,z]  +  Ik  *  ^(■’P2)||L2[0,z] 


+  +  mg 


88  v 


dz 


L2(RZ). 


Now  assume  that 


I9  = 


0 


and  choose  0  €  Co°(R)  with  V’  =  1  on  [0,  Z].  For  p  €  [Pi,P2],  denote  by  8v 
the  extension  8v  by  a  constant  for  z  >  Z.  Then  (since  suppp  C  R+) 

||0*M'»p)IIl*M]  <  \\9  *i>Sv(-,p)\\l-2[R] 


=  J  dk\g(k)\2x/>8v(k)Y 


1 


\Z2tt  J 
<  (sup 


idkm 


kWl\ 
*  1*1  ) 


1*1 2 

2 


\iki^8v{k)\‘ 


-^(xp8v{-,p)) 


LH  R) 


88  v 


dz 


(■>P) 


L2([0,Z]) 


since  8v  is  constant  for  z  >  Z  and  £w(0,p)  =  0. 
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Define 


l»«l 

£-  =  TTT 


From  the  bound 


dSv 

dSv  . 

<  c 

L2[0,2(] 

ax 

l2[o,x2] 


valid  for  66  G  we  get  for  any  P\  <  Pi  <  Pi  <  Pi, 


rP2 

2  f 

<966 

/  dp  Kg 

ax 

Jpl 

L2[0,Z]  [ 

£2[o,r2/2] 


dSv .  . 

+  I  ~^r~('iP2) 


dx 


L2[0,X2/2] 


+  ||*Q|| 


£2(fl2) 


+  ml 


<966 


dx 


x2(fl2). 


(6.2) 


Next  we  estimate  the  p-derivative  of  Kg: 


The  first  term  is  clearly  dominated  in  L2[ 0,  Z ]  for  each  p  by  a  (^-dependent) 
multiple  of 

!l<5t>(*,p)||^2[0,  i  r2] 

From  the  definition  of  6Q, 

/  ^  =  Jdz  |«3  o  r  -  ip  (/  &  «„)  ^  -  3 pjv'Sv} 
which  is  bounded  by  a  (^-dependent)  multiple  of 

P<3(,)P)lk2[o,T2]  +  IIM’>P)IIl2[0,±T2] 
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Finally  we  note  that 

c 

7)  —  - 

y/1  —  C2p2 

for  some  c  with 

||c||tfi[o,Z]  <  C 

so  that  for  any  p  €  [Pi,  P2], 

££(;p)  <  C 

dzdp  l2[o,z] 

The  upshot  is  the  estimate 

BK 

<e{||«<?(-.P)||I1|0,iTll 

UP  L2[0,Z\ 

+  ||<5u(-,p)||i2[oiir2]} 

(6.3) 

Integrating  in  p,  we  also  get 

r)K 

<  C’dl^ll^)  +  IH|l2(k2)) 

Up  L2(RZ) 

(6.4) 

Now  we  combine  (6.2)  and  (6.4)  to  estimate  Kg  via  a  simple  interpolation 
inequality,  which  is  a  special  case  of  Gilbarg  and  Trudinger  (1983)  Theorem 
7.28: 

For  u  €  H 2  [a,  6] ,  a  >  0 : 

ll“1ll2[a,&]  <  allU,,|lL2[a,i.]  +  ~IMIl,2[a,&] 

(6.5) 

where  C  =  C(\b  —  a\). 

Apply  (6.5)  to  u(p)  =  /  dp  Kg(z,p)  for  arbitrary  po  6  [Pi,P2],  integrate 

Jpo 

the  result  in  z,  and  use  (6.2)  and  (6.4)  to  obtain 
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H*X(«,>  S  S  {e?  ([ A  -  PA  II If  (•, PotllLto.  jiy  +  Ilf  11(0 


4.  (m1)2  —  2  + 

+  v  dx  l2(h2)  + 


11*%,*,)}  +  “C'{||«?||’1()W  +  ||*  UIIl2(H2)} 


With  the  choice 


a*  =  max(e3,  mj), 
a.  =  min(£3,m*) 

this  becomes  after  integration  in  po  from  P\  to  P2 
all LHRZ)  ^  {|| 

+  C'(«' +  ±)\\SQ\\lHfl2) 


||  K,  112 


9*  L*(A2)  +  W6v\\lHRi) 


R2)} 


(6.6) 


The  next  step  is  to  show  that  Kg  actually  dominates  the  indefinite  integral 
of  6v.  It  is  at  this  point  that  some  constraints  on  v  (hence  on  c),  other  than 
coherency  and  membership  in  J2  ,  become  necessary.  Recall  that 

K„  =  -^p(JdzSv)g*^ 

Thus  Kg  is  the  product  of  a  relatively  smooth  factor  (the  indefinite  integral) 
and  a  relatively  rough  factor  (the  derivative).  Clearly,  some  estimate  of  the 
smooth  factor  must  be  possible,  provided  that  the  rough  factor  is  sufficiently 
uniformly  rough.  The  following  simple  lemma  gives  a  crude  criterion  of  this 
type: 


Lemma  2  Suppose  that  tt,<£  G  C°°(R).  Set  for  A  >  0,  a  <  b 

1  rx+  y 

r(l’A)=A  L*  M 


r,(A)  =  inf  r(a:,A),  r*(A)  =  sup  r(x,A) 

*€[a,6]  *€[*,«>] 

Then  for  any  A  >  0,  (L2 -norms) : 

II*<Xmi  >  ^11*112™  -  T<r-<A>  +  -••(A))A!||*'rt,M1 
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Proof:  Set 


1  fx+  f 

**m-aJU* 

Then  the  Cauchy-Schwarz  inequality  gives 

4 


l*(»)  -  *a(x)|  <  | a!  i*'i 


for  a:  —  y  <  y  <  x  +  y.  Thus 


rx+  y 

o  16  /':r+f'  .,0 

/  A  i<h2  > 

J  X—  ~2 

i$a(x)2-ta  j[_^  m2 

rx+ 4 


2  ul2 


A 
'  2 

16  rx+f 

>  A|$A(z)|2r.(A)  -  — A2r*(A)  |*'| 


Similarly 


Thus 


ci  i*“i2  *  ^  p |$|2  -  y  (-<  a> + -w  c; |$,|! 

Now  sum  both  sides  over  x  =  (k  +  |)A,  k  =  0, . . .  [y^2]  to  obtain  the  result, 
q.e.d. 

We  shall  apply  Lemma  2  to  Kg,  with  the  identifications 

$  ~  ~\p S 8v (',p) 

dv 2 

u  „  g  *—{■  ,p) 


Note  that 


so  that 


9  * 


^  =  (l-cV)-!cS.|  +  E2 
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We  will  assume  that  the  length  scale  A  is  chosen  so  that  supp  <7  C  [y,  y]. 
Then  a  slight  refinement  of  the  standard  error  estimate  gives 


i  Cl  ^  -  CAM(A)a„(A) 


where  we  have  written,  for  any  compactly  supported  measure  h ,  and  Z0  >  0 
to  be  determined: 


rl(  A)  = 

1 

r*+f 

,  dc 

2 

=  sup 
2€[0,Zo] 

A. 

u 

h  *  — 
dz 

1 

fz+  f 

,  dc 

2 

rM(A)  = 

=  inf 

26[0,Zq] 

A. 

u 

fa  _ 

dz 

1 

/2+  f 

flM* 

dc 

«/i(A)  = 

=  sup 

z£[Q,Zq] 

A. 

u 

dz 

For  various  ^-dependent  constants  C1-C4,  any  2  £  [0,  Z0\, 

c,ra,.(A)  -  c2aV;(ak(a) 

dv2\‘ 


<  i  r1 

-  A  Jz-t 


g  * 


dz 


<  C3r*(A)  +  C4A2r*(A)aa(A) 

Accordingly  Lemma  2  implies,  so  long  as  Zq  <  Z,  for  po  E  [A,  P2] 

2 

L2[o,z0] 

-C'(r,,(A)  +  r*(A)  +  rg*(A)as(A))A2||<5u(-,p)||22[0,Zo] 


l|Jf»(-.P)lli»[o,z)  £  Cp2r,,,(  A)  J  dzSv(-,p) 


(6.7) 


These  estimates  have  no  force  unless  the  quantities  r3i*( A)  and 

r*(A)  =  (r,,„(  A)  +  r*(A)  +  rg*(A)a3(A)) 

are  comparable.  These  quantities  depend  on  the  parameters  A  and  Z0  and 
on  the  kernel  g  (all  of  which  are  still  to  be  chosen)  —  and,  of  course,  on  the 
velocity  profile  c. 
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To  further  manipulate  the  norms  of  Sv,  8v,  we  require  an  estimate  on  the 
p-derivative:  after  some  algebra, 

dr  12 

—  dz8v(-,p ) 

dp  J  Il2[o,z0] 


dp 


=  2  ( J  dz  8v(-,p),  J  dz  ^>(7  °  r  +  \p  (/  dz  Sv'j  )/^[o,^b] 


<  C 


[  dz  8v(-,p)  +  C'  || SC  o  r(-,p) \\2LHo  z  ] 

J  L2[0,Zo] 


Here  we  have  employed  an  estimate  on  norms  of  products: 
Lemma  3  For  f  G  Hl[a,b\,  g  G  L2[a,b\,  define 

f  fg  =  h(z ),  a  <  z  <b 

J  a 


Then 


lo  <  C\\fh  1 9\\ 

\\Ja  Ho 

for  a  constant  C  depending  on  a,  b. 


Proof:  Since 


we  have 


h{z)  =  f{z)  f  9~  f  dz' f'(z')  l  g 

J  a  J  a  Ja 


o  <  \\J\\L°° 


WjsWo+^ldz  {J’d*'fV)JoZ  9 J 

<  Cll/ll,  II J  s\\o  + fid*  [fi  i*'W  (*')?]  fid z'  \fi  s 


whence  the  required  inequality  follows. 

Thus  Gronwall’s  inequality  gives 

\  f  N  ^ 

/  dz  8v(-,p) 


L2[o,z0] 


-  Jdz8vfi,p0) 


2 

L?[0,Z0] 


<  C\\8Q\\lHk2) 


q.e.d. 
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for  any  po  €  [Pi,  A]-  Thus 

/  dp  p  \\  dz  Sv(-,p) 

JPi  \\J  z,2[o,z0] 

so  that  (6.7)  implies,  after  integrating  the  preceding  inequality  in  po  from  Pi 

to  P2, 

C\\Kg\\2L2(Rz) 

^  C',r5,*(Z^)ll  MIl2([0,Z0]x[Pi,P2]) 

~C 

+  r*(A)A2|IHl£2([o,z0]x[pi,P2])} 


Now  concantenate  the  above  inquality  with  (6.6)  to  get 


Im  IIl2([o,z0]x[p1,p2]) 

-  C"  {||^<5Hi2(fl2)  +  r*(^)^2||^|li2([0,Zo]x[Pi,P2])} 


<  + c'(a*  + 

Cx»  Cx+  v  2 

Next  recall  that  the  L2-norms  of  8v  and  8v  are  related  by 


ll^uIU2(pz)  <  C|I^IIl2(p2) 

as  noted  at  the  beginning  of  the  section,  which  allows  us  to  simplify  the 
above  inequality  to: 


rs,*(A) II  /  dz  HIl2([o,z0]  X[Pl,P2]) 

<  C  (^  +  <(A)A2)  ||«S||2h,  (flj) 

+  C' (l  +  a' +  £)  ||«7||21(fi])  (6.8) 
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It  is  convenient  at  this  point  to  put  the  left-hand  side  in  terms  of  8v  also,  by 
means  of  Lemma  3.  Since 

rz  fr(z,p) 

/  dz'8v(z',p)  =  /  dx  v(x,p)8v(x,p) 

Jo  Jo 

we  can  apply  Lemma  3  with  /  =  (5(-,p))-1,  g  =  v(-,p)8v(-,p),  to  get 


J  6«(*,Po) 


L2[0,r(Z0,po)] 


<  c 


J  dv(-,p0) 


L2[0,Zo] 


whence  (6.8)  implies,  after  integration  in  po  from  Pi  to  P2, 

r5,*(A)  ||/  *»|| 


<  C  (^2  +  rt*(A)A2)  II^IIJ 


+c'{  i  +  <**  +  ~)II^<5IIl2(h2) 

Here  A3  is  the  region 

{(x,p)  :  Pi  <  p  <  P2,  0  <x<t(Z0,p)} 

At  this  point,  we  determine  Zq :  set 

Zo  =  <fa,P,)=JnUc(\T0,I>] 

Thus 

A3  C  Ro 

The  interpolation  inequality  (6.5)  and  (6.9)  now  yield 


ll^lli2^)  — 


P  + 


C  ( (a*)2 


^rs,*(A)  V  a* 


+  r*(A)A‘ 


J  J 


||65 


(6.9) 


nil  2 


\W(Ri) 


+ 


C'(l  +  a*  +  (1/a*))  ^||2 


Prg,.{  A) 


■¥Q\\ 


lhr2) 


(6.10) 
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It  follows  immediately  from  the  Dirichlet  condition  at  x  =  T2  in  the 
definition  of  ^  that 


It  is  only  slightly  more  difficult  to  estimate 


88  v 


8x 


L2(R2\Ro) 


II^IIl2(Hi\a3)  —  c 


In  fact,  from  the  inequality 


88  v  II 2 


8x 


IL*()?2\£o) 


88v 

2 

88v 

2  l 

88v 

2  \ 

dp 

+ 

8x 

<_c[ 

8x 

+  |6Q|2  j 

which  holds  at  every  point  in  R2,  and  the  already-used  inequality 

1 88v\ 


II^IIl2(r2\A))  -  \JTi  ~ 


8x 


\  L2  {R2\Rq) 

(Dirichlet  condition  at  x  =  T2!)  follows  the  estimate 

1 88v  I 


whence  from  the  trace  theorem 

||5{j(’,  -Pi)  ||i,2[J0iJ2]  <  C 


8x 


L2(R2\Ro) 


+  II^QI!l,2(K2) 


88v 


8x 


+  II^QIIl2(r2) 


(6.11) 


\L2(R2\Ro) 

Any  point  in  Ri\A3  lies  on  a  curve  of  the  family 

P  *-►  t(z,p) 

which  joins  it  to  the  segment  [To,T2]  x  {Pi},  and  8Q  is  a  first  order  partial 
differential  operator  (acting  on  6i5)  whose  principal  part  is  a  tangent  vector 
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field  to  this  family  of  curves.  Now  (6.11)  follows  from  the  standard  energy 
estimate  for  hyperbolic  systems. 

Combining  (6.11)  with  (6.10)  we  get 


II*5 


~.l|2  < 


I  L2(Ri)  - 


P  + 


C  ((«*)' 

Pr9A&)  v 


+  «A)AJi 


-II 2 


ll*5ll/fi(flo) 


+  C' 


dSv 

11**2  llz,2(H2)  + 

dx 

2 

L2(H2\fl o). 


where  C'  depends  on  /5 ,  a«,  a",  and  r3i»  as  well.  Now  recall  the  conclusion  of 
Theorem  3  which  we  rewrite  in  a  suggestive  way: 


+  (mp)2^  2 


88  v 


dx 


r.ll2 


>  Ll\M\m(Ro)  “  (£o  +  L\)\\Sv\\lHki) 

Evidently  Theorem  4  has  been  proved  if  we  can  make  the  various  choices 
deferred  above  so  that 


(Ll  +  L\) 


0  + 


C 


^s.*(A) 


+  rt*(A)A2 


<L\ 


(6.12) 


To  understand  the  l.h.s.  of  (6.12)  we  first  examine  a  =  max(e3,m*).  We 
relieve  the  reader’s  suspense  by  identifying  the  test  kernel  g  with  the  kernel 
th[  appearing  in  the  statement  of  the  theorem,  so  that  the  issue  becomes  one 
of  choosing  e  >  0.  Note  that  with  this  identification 

ml  =  J  dz\ztK(z)\  =  0{e) 


and  similarly 


es  =  O(e) 


On  the  other  hand,  from  the  definitions, 


r3(A)  =  r*(Z0,  A,  e) 
r3,.(A)  =  r*(Z0,  A,  e) 
r;(A)  =  f(Z0,A) 
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while  Young’s  inequality  gives 


o,g{  A)  <  Cr(Zo,  A) 

provided  that  e  <  A,  say,  so  that  the  support  of  he  is  contained  within  a  fixed 
number  of  A-intervals.  In  fact,  we  shall  adopt  the  convention  that  e  <  C A2 
and  assume  that  A  is  sufficiently  small  that  e  <  A  as  well.  Then 

r«,.(A)  <  C(r,(Z0,  e,  A)  +  r*(Z0,  e,  A)  +  r(Z0,  A)2) 

where  the  constant  now  depends  on  the  test  wavelet  hi  as  well  as  on  — 
this  will  be  the  default  dependence  for  constants  “C”  for  the  rest  of  the 
argument. 

Note  the  uniform  (over  c  €  H*oc  corresponding  to  v  6  ]£)  constraints,  for 
all  e,  A  >  0: 

r,(Zo,e,  A)  <  r*(Zo,e,  A)  <  C 
f(Z0,A)  <  C 

r*(Z0,e,A)  <  Cr*(Zo,  A) 

Choose  a  lower  bound  Mi  and  a  relative  lower  bound  M2,  in  the  state¬ 
ment  of  Theorem  4 ,  subject  to  these  constraints,  and  assume  that  c  satisfies 
condition  (iv)  of  the  theorem  (recall  that  we  have  already  met  the  constraint 
on  Zq ,  viz. 

T0  =  t(Z0,P1)  ). 

That  is, 

Mi  <  r.(Z0,e,  A) 
f(Z0,A)  <  M2r,(Z0,e,  A) 
r*(Z0,  c,  A)  <  M2r*(Z0,  e,  A)  (6.13) 

Then  the  ratio 

C 

raAA) 

is  0(e  +  A2)  =  0( A2)  for  coherent  veJ2  for  which  the  corresponding  velocity 
profile  c  satisfies  (6.13).  Then  we  can  choose  /?  =  0( A)  so  that  the  l.h.s.  of 
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(6.12)  becomes  0(A)  as  well.  In  particular,  for  sufficiently  small  A,  other 
choices  as  above,  the  l.h.s.  of  (6.12)  is 

<  C(L\  +  Li) A  <  i L\ 

whence  finally 

s  +  CMII Ur,)  +  c'  -  -  (6'14) 

The  r.h.s.  of  (6.14)  is  bounded  by  a  multiple  of  the  Hessian  quadratic 
form,  the  factor  now  depending  on  the  penalty  constants  a  and  A  as  well. 
This  completes  the  proof  of  Theorem  4 ■  q.e.d. 

The  proof  of  Theorem  1  is  now  immediate.  Given  appropriate  To,  T),  X2, 
K0,  Ki,K*,  Pi,  Pi,  Co,  Ci,  and  c*,  let  £c  denote  the  collection  of  c  6  //^C(R) 
satisfying 

(i)  c(z)  =  co,  z  <  0 

(ii)  II  log  c|| £,*.(!*.)  <  c*  (<  -  log  P2) 


(iii)  c(z)  =  ci  if  2jT<y/i-T12>r0. 

For  some  l*  =  £(c*), 

II  log  c| | hi [0,z]  <  c*  — ►  ||logu[c](-,p)||ffl[0,iT0]  <  P 


Then  c  £  Ylc  =►  v[c]  £  as  defined  before  the  statement  of  Theorem  4- 

Let  m  be  as  in  the  statement  of  Theorem  3  and  assume  that  /  £  £'(R) 
is  chosen  to  satisfy 


d<b 

K*  4- 


>  /* 


>*  dl 


«.  11*11 


for  <f>  £  /T(R) 


m*  <  m 
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Choose  a  test  wavelet  h\  as  described  above.  Then  choose  Mi, M2,  A, 
and  e  as  in  the  statement  of  Theorem  4  and  define 

Ec  =  ic  €  Ec  :  for  A,  e  with 

0  <  A  <  A,  0  <  e  <  c,  and  Z0  satisfying 


the  following  inequalities  hold: 

Mi  <  r„(Z0,e,  A) 

M2r,(Z0,e,  A)  >  max(f(Z0,  A),  r*(Z0,  e,  A))} 

Then  c  is  constant  for  z  >  Z0,  so  v[c]  is  constant  (for  each  p  G  [Pi,  P2])  f°r 
t  >  r(Zo,p),  whence  a  fortiori  for  f  >  To- 

The  remarks  after  the  statement  of  Theorem  4  show  that,  for  arbitrary 
(but  consistent)  choices  of  the  various  parameters,  the  set  Ec  is  nonempty. 

Finally,  assume  that  in  the  definition  of  Ja<\, 

D  =  S[c]  =  5[u[c]] 

for  c  G  Ec-  Then 

«Wv[c]]  =  0 

while  Theorem  4  shows  that  the  Hessian  of  J^a  at  v[c]  is  positive-definite. 
Therefore  the  implicit  function  theorem  implies  the  existence  of 

1.  an  open  neighborhood  U  in  L2(R\)  of  the  set 

{%]  :  c  G  E'} 

2.  an  open  neighborhood  V  in  Yj  °f  the  set 

{6[c]  :  c  G  Ec} 

so  that  for  each  D  G  U,  the  problem 

minimize^^  a  [6] 
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has  a  unique  solution  v  =  I[D]  €  V ,  which  is  moreover  a  Lipschitz  continuous 
function  of  the  data  D. 

Define  the  averaging  operator 

A--Y.-+ 


by 


A6W  = 


v(T(z,p),p)t 


+  P 2 


-1/2 


Remark.  A  performs  a  version  of  the  operation  “normal  moveout  correction, 
stack”  from  the  reflection  seismic  data  processing  stream:  see  e.g.  Yilmaz 
(1987),  Section  1.4. 

Then  for  c  £  J2c  > 

Av[c]  =  c 

Also,  A  is  Lipschitz  continuous  in  the  topologies  indicated  in  its  definition. 
Set 

/  =  A  o  7  -.U~*  L\oc 

Then  I  has  all  of  the  properties  indicated  in  the  statement  of  Theorem  1.  In 
particular,  for  Di,T>2  G  U, 

||7(Di)  -  I(D2)\\L2[0'Z]  <  L*||Di  -  7?2||i,2(r1) 

for  suitable  L*,  depending  on  the  various  parameters  defining  Ylc  an^  on  /• 
This  completes  the  proof  of  Theorem  1.  q.e.d. 
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7  Non-Elliptic  Sources. 


In  this  section  we  give  a  very  brief  sketch  of  the  state  of  affairs  when  /  is 
smooth.  The  necessary  regularization  arguments  have  become  quite  com¬ 
monplace,  so  we  shall  concentrate  on  the  steps  necessary  to  modify  the  proof 
of  Theorem 

Thus,  suppose  that  /  £  C£°(R):  then  the  best  “near-elliptic”  estimate 
might  have  the  form 


>KX 

LH  R) 


d<f> 

dx 


~  Ko  ||^Hl*(R)  -  ^2 

LHR) 


d2<t> 
dx 2 


L>(  R) 


(7.1) 


for  <f>  £  f/2(R).  The  size  of  Ki  measures  the  “passband”  of  /:  i.e.  if  |/(cj)| 
is  uniformly  large  in  an  interval  fl*  <  |u>|  <  fi/i,  then  K%  =  0(1/Tlh)- 

The  analogue  of  Theorem  3  is 


Theorem  5  Given  P2  >  P\  >  0,  T\  >  To  >  0,  K*  >  K\  >  0,  Ko ,  K2  >  0, 
and  i*  >  0,  there  exist  fh,  Lq,  Li,  L2,  L3,  >  0,  so  that  if  f  £  C^°(R)  satisfies 

(7.1)  and  mlf  <  fh,  then  for  v  £  ^  fl  H(oc(  R),  H2oc(R) 


dSv 

||D5[u]6t;||L2(Rl)  >LX  — 


—  £o||£u||L2(flj) 

I  d2Sv  II 


L2(Ro) 

dSv 


dx 


L*(Ri\Ro) 


-  L? 


dx 2 


The  principal  new  ingredient  in  the  proof  is  the  higher-order  estimate  for 
the  plane-wave  problem 

||£>S,0[t>,£']HU2[o.T]  <  C'2||^t;||/y2[o,l/2T] 

for  log  v  £  i7j2oc(R),  Sv  £  H?oc( R).  See  for  instance  Suzuki  (1988)  for  similar 
estimates. 
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Most  of  the  proof  of  Theorem  4  goes  through  as  before,  except  that  now 
the  smoothness  constraint  implicit  in  Theorem  5  conflicts  with  the  roughness 
conditions.  For  example,  for  a  coherent  v  G  its  corresponding 

c  6  ff^(R)  satisfies 


For  the  constraint  e  =  0(A),  which  we  were  bound  to  impose  this  gives 


<  CA||c//||L2[0iz0] 

So  over  any  bounded  set  in  H2(R2),  r,  is  0(A)  over  coherent  travel-time 
velocities.  Thus  estimates  of  the  sort  developed  in  Section  6  can  only  succeed 
if 


(i)  /  is  sufficiently  “broadband”  that  (7.1)  holds  with  small  K2  relative  to 

Y),K0,Kx,K*; 

(ii)  target  velocities  exist  in  the  intersection  of  ]£'  and  a  sufficiently  large 

ball  in  H2(R2),  for  which  the  regularized  cost  functional 


J.J.M  ■■=  \  {|I$H  -  Bfe(ft) 

+  <'J||C[«]||il(S|)  +  A’l|a81 

dv 


dx 


L2(R2\Fk,) 


+  P‘ 


dx 


L*(R  2). 


takes  a  sufficiently  small  value,  with  p  =  0(K2). 

Then  the  Hessian  of  Ja,\,P  will  be  positive-definite  at  target  velocities  as 
described  in  (ii),  while  the  value  of  Ja,x,p  will  be  small  enough  to  conclude 
the  existence  of  a  nearby  local  minimizer.  Since  it  will  no  longer  be  pos¬ 
sible  to  have  J  =  0,  only  an  approximation  will  be  obtained  even  for  data 
corresponding  exactly  to  c  G  n//1„c(®')* 


7.2 


The  reader  is  referred  to  Symes  (1986b)  for  details  of  a  similar  construc¬ 
tion. 
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